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The organization of nucleosomes influences transcriptional activity by controlling accessibility of DNA binding proteins 
to the genome. Genome-wide nucleosome binding profiles have identified a canonical nucleosome organization at gene 
promoters, where arrays of well-positioned nucleosomes emanate from nucleosome-depleted regions. The mechanisms of 
formation and the function of canonical promoter nucleosome organization remain unclear. Here we analyze the ge- 
nome-wide location of nucleosomes during zebrafish embryogenesis and show that well-positioned nucleosome arrays 
appear on thousands of promoters during the activation of the zygotic genome. The formation of canonical promoter 
nucleosome organization is independent of DNA sequence preference, transcriptional elongation, and robust RNA 
polymerase II [Pol II) binding. Instead, canonical promoter nucleosome organization correlates with the presence of 
histone H3 lysine 4 trimethylation (H3K4me3) and affects future transcriptional activation. These findings reveal that 
genome activation is central to the organization of nucleosome arrays during early embryogenesis. 



[Supplemental material is available for this article.] 

Nucleosome organization affects transcriptional activity by influ- 
encing the access of DNA binding proteins to the genome (Jiang 
and Pugh 2009b). Recent studies in various model organisms have 
shown that nucleosome organization is not random in many 
functionally important regions of the genome (Yuan et al. 2005; 
Albert et al. 2007; Lee et al. 2007; Mavrich et al. 2008a,b; Schones 
et al. 2008; Shivaswamy et al. 2008; Valouev et al. 2008, 201 1; Jiang 
and Pugh 2009b; Teif et al. 2012). In particular, nucleosome- 
depleted regions have been observed near the transcription start 
sites (TSSs) of many genes. Moreover, arrays of positioned nu- 
cleosomes often emanate from nucleosome-depleted regions 
(Albert et al. 2007; Valouev et al. 2008, 2011; Jiang and Pugh 
2009b). Studies in yeast have indicated that nucleosome pat- 
terns are influenced (Segal et al. 2006; Kaplan et al. 2009), but 
not determined (Zhang et al. 2009; Stein et al. 2010; Weiner et al. 
2010), by DNA sequence. ATP-dependent chromatin remodelers 
(Gkikopoulos et al. 2011; Zhang et al. 2011) and transcription 
initiation and elongation (Zhang et al. 2009; Weiner et al. 2010; 
Hughes et al. 2012) have also been implicated in the organization 
of nucleosomes in vivo. 
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Although nucleosome organization profiles have been gener- 
ated for vertebrate organisms (Hu et al. 2011; Li et al. 2011; Valouev 
et al. 201 1; Gaffney et al. 2012; Nekrasov et al. 2012; Teif et al. 2012), 
it remains unclear how and when canonical promoter nucleosome 
structures form (Bai and Morozov 2010). Here, we address these 
questions by analyzing changes in nucleosome organization during 
the maternal-zygotic transition in zebrafish embryos. In zebrafish, 
the bulk of zygotic transcription begins 3 h post fertilization (hpf), 
when a period of rapid synchronous cleavages ends and cell cycles 
lengthen (Kane and Kimmel 1993; Schier 2007; Tadros and Lipshitz 
2009). Specific histone modifications appear during this transition 
(Vastenhouw et al. 2010; Lindeman et al. 2011), resulting in a 
chromatin signature characteristic of embryonic pluripotency 
(Vastenhouw et al. 2010). Therefore, the maternal-zygotic transition 
provides an ideal in vivo system to study the relationship among 
transcriptional activity, specific histone methylation marks, and 
nucleosome organization in vivo. 

Results 

Emergence of well-positioned nucleosome arrays 
during genome activation 

To analyze nucleosome organization during genome activation, 
we combined micrococcal nuclease (MNase) digestion of chro- 
matin with high-throughput sequencing (MNase-seq) (Supple- 
mental Fig. S1A-C). MNase preferentially cuts within linker DNA, 
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so the sequenced fragments indicate the positions of nucleosomes 
when they are mapped back to the genome. We generated two 
biological replicates for two developmental stages, resulting in 
a total of 487 and 519 million mapped reads for the 256-cell (2.5 
hpf; before the maternal-zygotic transition) and dome/30% epib- 
oly stages (4.5 hpf; after the maternal-zygotic transition) (Supple- 
mental Fig. S1D,E), respectively. This represents over 50-fold ge- 
nome coverage, one of the deepest nucleosome coverages of 
a vertebrate genome investigated to date. 

We analyzed three major aspects of nucleosome organization: 
occupancy (nucleosome density), rotational positioning (orienta- 
tion of the DNA helix on the nucleosome surface), and translational 
positioning (relative location of nucleosomes along the DNA). While 
few changes were detected for nucleosome occupancy or rotational 
positioning during the maternal-zygotic transition (for details, see 
Supplemental Fig. S2; Supplemental Text 1), changes in translational 
positioning of nucleosomes were widespread. By calculating the 
positioning degree as previously defined (Zhang et al. 2009), we 
found that during genome activation, arrays of well-positioned nu- 
cleosomes appear at many genomic loci (for example, see Fig. 1A). 
Indeed, the number of arrays with five or more well-positioned nu- 
cleosomes increased over 10-fold from the 256-cell stage (996 re- 
gions) to the dome stage (10,523 regions). By analyzing the relative 
locations of all nucleosomes at both developmental stages, we found 
that nucleosomes are on average more regularly positioned at the 
dome stage than at the 256-cell stage (Fig. IB). 



Nucleosome organization in vivo is known to be affected by 
DNA sequence preference (Supplemental Text 1; Segal et al. 2006; 
Kaplan et al. 2009; Struhl and Segal 2013). To examine the role of 
sequence preference on the appearance of nucleosome anays, we 
applied a computational model that predicts nucleosome organiza- 
tion from DNA sequence (Kaplan et al. 2009) to the regions with well- 
positioned nucleosome anays at the dome stage (after genome acti- 
vation). The observed and predicted nucleosome organization shows 
very weak correlation at the dome stage (Supplemental Fig. S3A,C), 
while for the same genomic regions, the sequence-based prediction 
performs slightly better at the 256-cell stage (before genome activa- 
tion) (Supplemental Fig. S3B). This raises the possibility that the 
formation of nucleosome arrays during genome activation works 
against the DNA sequence preference of nucleosomes. 

To investigate the potential functional relevance of well-po- 
sitioned nucleosome arrays at the dome stage, we examined their 
genomic distribution. Well-positioned nucleosome arrays are highly 
enriched at promoters (Fig. 1C). Indeed, the average promoter nu- 
cleosome profile across all genes shows a clear nucleosome-depleted 
region and positioned nucleosome arrays emanating from the TSS 
(Fig. ID). This type of canonical promoter nucleosome organiza- 
tion has been reported in other organisms (Jiang and Pugh 2009a; 
Radman-Livaja and Rando 2010). Gene Ontology (GO) analysis 
revealed that several GO categories related to basic cellular func- 
tions are significantly overrepresented in the genes with well- 
positioned nucleosome arrays at their promoters (Supplemental 
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Figure 1. Well-positioned nucleosome arrays are established at promoters during genome activation. (A) Sample profile of nucleosome organization 
around the promoter of supt6h at the 256-cell (blue, before the maternal-zygotic transition) and dome (green, after the maternal-zygotic transition) 
stages. Gaussian smoothing of midpoints of nucleosomal DNA was used to create the nucleosome profile. (B) Relationship between the position of 
adjacent nucleosomes before (blue) and after (green) the maternal-zygotic transition. The total number of nucleosomal sequencing reads with a start-to- 
start distance equal to the value on the x-axis is shown. (C) Genomic distribution of well-positioned nucleosome arrays at the 256-cell and dome stages. 
Promoters: regions from 2 kb upstream of the TSS to 2 kb downstream from the TSS. Gene bodies: regions from 2 kb downstream from the TSS to the 
transcription termination sites (TTS). Intergenic regions: all other genomic regions. (D) Average promoter nucleosome organization across all genes before 
(blue) and after (green) the maternal-zygotic transition. All reads were extended to 147 bp; the middle 73 bp were taken, piled up, and normalized by 
sequencing coverage. The +1, +2, +3, and -1 nucleosomes around TSS at the dome stage are indicated. 
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Fig. S4A). Furthermore, genes with these arrays at their promoters are 
more conserved across eukaryotic species (Supplemental Fig. S4B) 
and enriched for housekeeping genes (Supplemental Fig. S4C). 
Analysis of the nucleosome anays observed at the 256-cell stage 
revealed a statistically significant but weaker enrichment at the pro- 
moters (Fig. 1C). The average promoter profile at the 256-cell stage 
also shows weaker canonical nucleosome organization (Fig. ID), and 
no GO categories were significantly enriched. Taken together, our 
results reveal that well-positioned nucleosome arrays emerge at the 
promoters of housekeeping genes during genome activation. 

Co-occurrence of well-positioned nucleosome arrays 
with H3K4me3 marks 

Because well-positioned nucleosome arrays at the dome stage are 
highly enriched at promoters, we investigated the potential re- 
lationship between well-positioned nucleosome arrays and histone 
marks associated with transcription. We performed nucleosome- 
resolution ChlP-seq of H3K4me3, H3K27me3, and H3K36me3, 
and ChlP-seq of RNA polymerase II (Pol II) with two antibodies at 
the dome stage (Supplemental Table SI; Supplemental Fig. S5A,B), 
when these marks are well-established (Vastenhouw et al. 2010; 
Lindeman et al. 2011). Interestingly, 91% of nucleosome arrays at 
promoters were marked by H3K4me3 (Fig. 2A,B), and H3K4me3 
enrichment was stronger in promoters with well-positioned nu- 
cleosome arrays than in those without (Supplemental Fig. S6A). 



These results suggest a potential quantitative relationship between 
H3K4me3 and nucleosome arrays at the promoters. Indeed, when 
we ranked genes by their promoter H3K4me3 signals, those with 
better rank had stronger canonical promoter nucleosome organi- 
zation (Fig. 2C). To more accurately assess the quantitative re- 
lationship between the level of nucleosome arrays and different 
marks, we assigned a "nucleosome array value" to each promoter 
(for details, see Methods). Overall, nucleosome array values at pro- 
moters showed stronger correlation with H3K4me3 enrichment at 
the promoters (Fig. 2D) than with either H3K27me3 enrichment at 
the promoters (Supplemental Figs. S6B,E, S7A), Pol II signals at the 
promoters and gene bodies (Supplemental Figs. S6C,F,G, S7C,D), or 
H3K36me3 enrichment at the gene bodies (Supplemental Figs. 
S6D,H, S7B). To determine whether there is also a conelation of 
positioned nucleosome arrays with promoter H3K4me3 in mam- 
malian pluripotent cells, we analyzed the quantitative relationship 
between the level of nucleosome arrays and different marks in 
mouse embryonic stem cells. While the differences between marks 
were moderate, we found similar relationships as in zebrafish 
(Supplemental Fig. S8). 

Formation of canonical nucleosome organization in the 
absence of transcriptional elongation 

Nucleosome arrays have been reported to result from transcriptional 
activity (Weiner et al. 2010; Hughes et al. 2012). We therefore ana- 
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Figure 2. Well-positioned nucleosome arrays and H3K4me3 co-occur at promoters at the dome stage. (A) Sample profile of nucleosome organization 
(green) and H3K4me3 (red) around mrp/76and mrpl39. Gaussian smoothing of the midpoints of nucleosomal DNAwas used to represent the nucleosome 
profile. Identified well-positioned nucleosome arrays are indicated by black bars. (B) Venn diagram showing the overlap of well-positioned nucleosome 
arrays and H3K4me3 peaks at promoters. A 1 00-bp overlap was used as a minimal required cutoff. (C) Heatmaps of nucleosome organization; H3K4me3, 
H3K27me3, and RNA Pol II signals around TSSs; as well as the H3K36me3 signal in the last 2/3 regions of the concatenated exons. Genes are ranked by the 
H3K4me3 signal at their promoter. Each horizontal line represents the average signal for 100 genes. Color represents RPKM value. Genes were evenly 
grouped into 20 bins based on H3K4me3 density at their promoters, and the associated distribution of nucleosome array values at these promoters is given 
in the boxplot. In heatmaps, short genes (<1 kb) were excluded, and if one gene has multiple annotations, only the one with the strongest H3K4me3 signal 
was kept. A total of 18,890 genes was used. (D) The correlation between H3K4me3 density and nucleosome array value at the promoters. H3K4me3 
density is calculated as the log 2 transformed RPKM + 1 for each promoter. 
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lyzed whether well-positioned nucleosome arrays at promoters are 
specifically associated with genes that undergo transcriptional 
elongation. The early zebrafish embryo is loaded with maternal 
transcripts, precluding the identification of transcription status 
from standard RNA-seq or expression microarray data. We used 
three approaches to determine the transcription elongation 
status of genes. First, we used a combination of H3K36me3 
ChlP-seq enrichment in exons and Pol II ChlP-seq enrichment in 
gene bodies to distinguish transcribed from nontranscribed genes 
at the dome stage (for details, see Supplemental Text 2; Supple- 
mental Fig. S9). Surprisingly, although nucleosome occupancy is 
lower for nontranscribed genes, both transcribed and nontranscribed 
genes show similar canonical promoter nucleosome organization at 
the dome stage using this approach (Fig. 3A; Supplemental Fig. 
S11A,D,G). In a second approach, we used a combination of 
H3K36me3 ChlP-seq enrichment in exons and Pol II traveling ratio 
(Reppas et al. 2006; Rahl et al. 2010) to distinguish transcribed from 
nontranscribed genes (for details, see Supplemental Text 2). This 
approach also revealed clear canonical promoter nucleosome or- 
ganization in nontranscribed genes at the dome stage (Supple- 
mental Fig. S12A-D). In a third approach, we used the classifica- 
tion of a recent study that defined transcription status by 
distinguishing between parental alleles (Harvey et al. 2013). Using 
this approach, we also found clear canonical promoter nucleosome 
organization in nontranscribed genes at the dome stage (Supple- 
mental Fig. S12E-H). 

In a previous study, we classified three types of non- 
transcribed genes at the dome stage based on histone marks in pro- 
moters: bivalent (H3K4me3/H3K27me3), monovalent (H3K4me3), 
and nonmarked genes (neither H3K4me3 nor H3K27me3) (Vast- 
enhouw et al. 2010). We analyzed the nucleosome organization 
patterns for these classes of genes and found the canonical nucle- 
osome organization to be present at the promoters of bivalent and 
monovalent genes, but absent from nonmarked genes (Fig. 3B). 
Together, these findings suggest that the relationship between the 
establishment of well-positioned nucleosome arrays and H3K4me3 
during the maternal-zygotic transition is largely independent of 
transcriptional elongation. 

Promoter nucleosome organization in the absence 
of robust Pol II binding 

Pol II has been suggested to contribute to the formation of well- 
positioned nucleosome arrays in promoters (Hughes et al. 2012). 
To assess the effect of Pol II on nucleosome positioning, we clas- 
sified Pol II at promoters into elongating and nonelongating based 
on the combination of promoter Pol II enrichment, gene body 
H3K36me3 enrichment, and Pol II traveling ratio (for details, see 
Supplemental Text 3; Supplemental Fig. S13A,B). Consistent with 
previous findings (Yin et al. 2011), the location of the +1 nucleo- 
some is shifted downstream in promoters with elongating Pol II 
compared to nonelongating Pol II (Fig. 3C, red curve vs. blue 
curve). Moreover, genes with the strongest Pol II signal in 
their promoter have weaker nucleosome arrays (Supplemental 
Fig. S13C-F), which has been observed previously (Gilchrist 
et al. 2010) and is potentially a reflection of rapid nucleosome 
turnover. 

Nucleosome array values at the promoters are better corre- 
lated with H3K4me3 than with Pol II (Fig. 2D; Supplemental Fig. 
S6F), and, consistent with our previous study (Vastenhouw et al. 
2010), many promoters with high H3K4me3 have undetectable 
levels of Pol II (Supplemental Fig. S5C). This motivated us to ex- 
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Figure 3. Canonical nucleosome organization at promoters in the ab- 
sence of transcriptional elongation. (A) Average promoter nucleosome 
organization for transcribed (red) and nontranscribed (blue) genes. 
Transcription status is inferred from H3K36me3 ChlP-seq enrichment in 
exons and RNA Pol II ChlP-seq enrichment in gene bodies. (B) Average 
promoter nucleosome organization for three types of nontranscribed 
genes: bivalent (marked by both H3K4me3 and H3K27me3 at promoters; 
red), monovalent (marked by H3K4me3, but not by H3K27me3 at pro- 
moters; blue) and nonmarked (marked neither by H3K4me3 nor by 
H3K27me3 at promoters; black) genes. (C) Average promoter nucleo- 
some organization for three types of genes: with elongating Pol II (red), 
with nonelongating Pol II (blue), and with H3K4me3 at promoters but 
without Pol II (black). (A-Q All reads were extended to 1 47 bp; the middle 
73 bp were taken, piled up, and normalized by sequencing coverage. 
Number of genes used to draw an average profile was indicated for each 
class. Short genes (<1 kb) were excluded. The accompanying nucleosome 
profiles for each class of genes in the normalized version and at the 256- 
cell stage are shown in Supplemental Figure S10. 



amine nucleosome organization at promoters that are marked by 
H3K4me3 but lack Pol II signals. Interestingly, those promoters still 
formed well-positioned nucleosome arrays (Fig. 3C, black curve). 
We examined the possibility that some genes without Pol II are 
bound by Pol II very weakly. As the average Pol II signal shows 
a tiny hump at the TSS of genes in this group (Supplemental Fig. 
SI II), we selected a subset of promoters in this group for which the 
Pol II signal is below the genomic average and analyzed their nu- 
cleosome profile. This more stringently selected subgroup of pro- 
moters still has well-positioned nucleosome arrays (Supplemental 
Fig. SI 4). Importantly we observed a similar pattern of well-posi- 
tioned nucleosome arrays at promoters in mouse embryonic stem 
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cells that are marked by H3K4me3 but lack detectable Pol II 
binding (Supplemental Fig. S15C). Although very low, transient, or 
undetectable Pol II binding might be involved in nucleosome 
positioning, our data reveal that robust binding by Pol II is not 
required for the establishment of well-positioned nucleosome ar- 
rays at promoters during the maternal-zygotic transition. 

Correlation of early promoter nucleosome organization 
with later transcription 

Nucleosome organization in promoters is proposed to facilitate 
recognition by the transcriptional machinery (Hughes et al. 2012; 
Yen et al. 2012). Since canonical nucleosome organization at the 
promoters consists of nucleosome-depleted regions and arrays of 
positioned nucleosomes, we investigated whether these features 
have any predictive value for gene activation. When classifying 
genes into five groups based on their level of promoter nucleosome 
depletion at the 256-cell stage (Supplemental Fig. S16A; for details, 
see Supplemental Methods), we did not observe a significant re- 
lationship between promoter nucleosome depletion and H3K4me3 
at promoters or gene transcription at later stages (Supplemental Fig. 
SI 7). In contrast, when genes are classified based on nucleosome 
array value at the 256-cell stage (for details, see Methods; Supple- 
mental Fig. S16B), those with higher promoter nucleosome array 
values at the 256-cell stage have higher levels of promoter H3K4me3 
(Fig. 4A) and are more likely to be transcribed at the dome stage (Fig. 
4B; Supplemental Fig. S18). To substantiate this finding, we ana- 
lyzed an additional developmental stage by generating ChlP-seq 
data for H3K4me3 at an earlier developmental stage (oblong; 3.5 
hpf) (Supplemental Table SI). Indeed, better nucleosome array 
values at the 256-cell stage were associated with higher H3K4me3 
marks at the oblong stage (Fig. 4A). Thus, the presence of nucleo- 
some anays at promoters before genome activation correlates with 
subsequent H3K4me3 acquisition and gene expression. To exclude 
the possibility that these associations are caused by histone modi- 
fications rather than nucleosome arrays at the 256-cell stage, we also 
generated ChlP-seq profiles for H3K4me3 at the 256-cell stage 
(Supplemental Table SI). Consistent with our previous results 
(Vastenhouw et al. 2010) and corroborated by analyzing data from 
an independent study (Supplemental Fig. S19A; Lindeman et al. 
201 1), signals of H3K4me3 were largely absent in promoters at the 
256-cell stage and less correlated with later gene activation 
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(Supplemental Fig. S19B). Taken together, the early presence of 
nucleosome anays is a better predictor of future gene activity than 
nucleosome depletion at promoters. 

Discussion 

We have exploited the transition from transcriptional silence to 
widespread gene expression during early zebrafish embryogenesis 
to determine how nucleosomes are organized at promoters. Our 
study provides three major insights. First, most well-positioned 
nucleosome arrays appear on promoters only during genome ac- 
tivation. The emergence of these well-positioned nucleosome ar- 
rays cannot be explained by DNA sequence preference. As nucle- 
osome organization is likely to be determined by a combination 
of multiple factors, more studies will be needed to identify DNA 
binding proteins and chromatin remodelers that are involved in 
the formation of nucleosome arrays during genome activation. 

Second, canonical nucleosome organization can form in- 
dependently of robust Pol II binding and active transcription, but 
correlates well with the presence of H3K4me3 marks. We favor 
a model wherein ATP-dependent activities are driving nucleosome 
array formation at the 5' ends of genes, as described in studies of in 
vitro nucleosome positioning in yeast (Zhang et al. 2011). Our 
results indicate that H3K4me3 is a better predictor of nucleosome 
array formation at the promoters than Pol II binding. Indeed, 
promoters marked by high H3K4me3 but no detectable Pol II 
binding can form well-positioned nucleosome arrays. Although it 
is impossible to exclude the possibility that very low, transient, and 
undetectable levels of Pol II binding contribute to nucleosome 
positioning, our analysis of data in mouse embryonic stem cells 
also suggests that H3K4me3 may be a better predictor of nucleo- 
some array formation than Pol II binding. We suggest that genome 
activation, but not transcription, is central to the organization of 
nucleosome arrays during early embryogenesis. 

Third, promoter nucleosome organization before genome acti- 
vation has predictive value for future gene activation. This observa- 
tion is reminiscent of the proposed roles of monovalent (H3K4me3) 
and bivalent (H3K4me3 and H3K27me3) domains in poising genes 
for activation (Bernstein et al. 2006; Vastenhouw and Schier 2012). It 
is tempting to speculate that pioneer factors, chromatin remodelers, 
and histone modifiers establish nucleosome arrays and H3K4me3 
marks at promoters before and during genome activation and prepare 
genes for subsequent transcriptional activation. 

Methods 

Zebrafish 

Zebrafish were maintained and raised under standard conditions. 
Wild-type embryos were collected at the one-cell stage, synchro- 
nized, and allowed to develop to the desired stage at 28°C. 

MNase digestion 

MNase digestion was performed essentially as previously described 
(He et al. 2010). Briefly, between 250 and 650 embryos were care- 
fully staged, dechorionated, and fixed in 1% formaldehyde for 3 
min at room temperature. Formaldehyde was quenched by adding 
glycine to a final concentration of 0.125 M. Embryos were rinsed 
three times in ice-cold PBS, immediately resuspended in cell lysis 
buffer (10 mM Tris-HCl at pH 7.5, 10 mM NaCl, 0.5% NP40), and 
lysed for 3-4 min on ice. Nuclei were collected by centrifugation, 
washed with ice-cold PBS, collected by centrifugation again, and 



264 Genome Research 



www.genome.org 



Nucleosome organization during genome activation 



resuspended in digestion buffer (50 mM Tris-HCl at pH7.5, 1 mM 
CaCl 2 , 0.2% Triton X-100). Samples were divided into 200 |xL ali- 
quots, prewarmed to 25 °C, and incubated with MNase (stored in 
MNase storage buffer [10 mM Hepes at pH 7.5, 100 mM NaCl, 
1 mM CaCl 2 , 50% glycerol] and diluted in MNase dilution buffer 
[50 mM Tris-HCl at pH 8, 10 mM NaCl , 126 mM CaCl 2 , 5% 
glycerol]). The concentration of MNase and time of incubation 
were optimized to obtain 80% mononucleosomes as determined 
by Bioanalyzer analysis (Supplemental Fig. SIB). 

If nucleosomes were directly sequenced, reactions were 
stopped by adding the samples to 200 jjlL MNase STOP and de- 
crosslinking buffer (1% SDS, 100 mM NaHC0 3 , 25 mM EDTA) in 
a 15-mL tube. Samples were then sonicated and centrifuged for 
10 min at 4°C, and the supernatant was incubated for 90 min at 
65 °C to reverse crosslinks. Upon precipitation, DNA was purified 
by a QIAquick PCR purification kit (Qiagen). 

If nucleosomes were used for ChIP, reactions were stopped by 
adding samples to 200 |xL MNase STOP buffer (10 mM Tris-HCl at 
pH 7.6, 20 mM EDTA) in a 15-mL tube. Samples were then soni- 
cated and dialyzed against ChIP buffer (27.8 mM Tris-HCl at pH 
7.7, 111 mM NaCl, 4.1 mM EDTA, 0.34% SDS, 0.8% Triton X-100) 
for 2 h at 4°C. After centrifugation, the supernatant (chromatin) 
was used in ChlPs as described below. 

Antibodies 

Antibodies used were H3K4me3 (Millipore 07-473), H3K27me3 
(Millipore 07-449), H3K36me3 (Abeam ab9050), and RNA Pol II 
(8WG16/Covance MMS-126R and ab5408). 

ChIP 

RNA Pol II ChlPs were performed essentially as previously de- 
scribed (Vastenhouw et al. 2010) with some minor modifications. 
Briefly, between 400 and 800 embryos were staged, dechorionated, 
and fixed in 1.85% formaldehyde and 12.5% DMSO for 15 min at 
room temperature. Formaldehyde was quenched by adding gly- 
cine to a final concentration of 0. 125 M. Embryos were rinsed three 
times in ice-cold PBS, immediately resuspended in cell lysis buffer 
(10 mM Tris-HCl at pH 7.5, 10 mM NaCl, 0.5% NP-40), and lysed 
for 15 min on ice. Nuclei were collected by centrifugation, resus- 
pended in nuclear lysis buffer (50 mM Tris-HCl at pH 7.5, 10 mM 
EDTA/1% SDS), and lysed for 10 min on ice. Samples were diluted 
three times in IP dilution buffer (16.7 mM Tris-HCl at pH7.5, 167 mM 
NaCl, 1.2 mM EDTA, 0.01% SDS), and sonicated to obtain fragments 
of —400 bp. Triton X-100 was added to a final concentration of 0.75%. 
Lysate was incubated overnight, with rotating, at 4°C with 25 jjlL of 
protein G magnetic Dynabeads (Invitrogen) that had been prebound 
to an excess amount of antibody. Bound complexes were exten- 
sively washed with RIPA (50 mM HEPES at pH 7.6, 1 mM EDTA, 
0.7% DOC/1% Igepal, 0.5 M LiCl) and TBS, then eluted from the 
beads with elution buffer (50 mM NaHC0 3 , 1% SDS). Cross-links were 
reversed overnight at 65°C, and DNA was purified by a QIAquick 
PCR purification kit (Qiagen). For MNase/ChIP, between 300 and 
700 embryos were used per condition, and after dialysis, chromatin 
was added to the bead/antibody complex as described above. 

Library preparation and sequencing 

Libraries were prepared using the Illumina sequencing library 
preparation protocol and sequenced on an Illumina HiSeq 2000. 

Read mapping and nucleosome profiles 

All sequenced reads were mapped back to the zebrafish genome 
(zv9 assembly) using Bowtie. Uniquely mapped reads with a max- 
imum of two mismatches were kept for further analysis. The for- 



mula used to calculate the genomic coverage of nucleosomal se- 
quence reads is as follows: number of reads X 147 bp/genome size. 
To create nucleosome profiles, all mapped reads were extended to 
147 bp in their 3' direction, and the middle 73 bp were piled up, as 
previously described (Zhang et al. 2008). 

Nucleosome array value and well-positioned 
nucleosome arrays 

The 73rd base pair downstream from the 5' end of a nucleosomal 
sequence read is termed the midpoint of nucleosomal DNA. To 
capture the features of well-positioned nucleosome arrays, we dis- 
played the signal of nucleosome midpoints in a two-dimensional 
coordinate system with genomic positions on the *-axis and the 
number of nucleosome centers on the y-axis. Signal processing was 
performed as follows. First, Gaussian smoothing was applied on 
the signals with a 147-bp sliding window and 30 bp as the standard 
deviation. The difference of the signals on adjacent base pairs (i.e., 
the first-order derivative of the signal) was taken, then the absolute 
value of the difference on every base pair was calculated. If the 
value at a certain base pair was larger than any value within the 
[-73 bp, +73 bp] window, it was defined as a local maximum. All 
adjacent maxima were connected to create a curve spanning the 
whole genome. The height of the curve at each base pair is the 
nucleosome array value. The nucleosome array value for each 
promoter was then calculated by taking the average value of all 
base pairs in the promoter region. 

Based on nucleosome array values, well-positioned nucleo- 
some arrays were identified as follows. The average nucleosome 
array value of the whole genome was calculated and used as 
background, and the nucleosome array value at each base pair was 
transformed to a fold enrichment value over the background. This 
creates two types of segments: over background (fold enrichment 
larger than one) and under background (fold enrichment larger 
than one). Well-positioned nucleosome arrays are defined as over 
background segments with length >1000 bp and average fold en- 
richment larger than 1.75. 

Histone modification and Pol II status of genes 

For this analysis, we used a set of 48,373 previously annotated 
genes in the zebrafish genome (Pauli et al. 2012). To ensure un- 
ambiguous classification between high and low statuses, genes 
with moderate enrichment for histone modifications and Pol II 
were excluded from average profiles analyses. For H3K4me3, if 
a gene contained more than 10 ChlP-seq reads in its promoter 
region (defined as the region from 2 kb upstream of the TSS to 2 kb 
downstream from the TSS), and the average signal in its promoter 
was >4 RPKM (reads per kilobases per million reads), the H3K4me3 
status of the gene was considered high. If the average signal in its 
promoter was <1 RPKM, the H3K4me3 status of the gene was 
considered low. The H3K4me3 status for all other genes was con- 
sidered moderate. The H3K27me3 status was defined with the 
same criteria. For H3K36me3, if a gene contained more than 10 
reads in the last 2/3 regions of concatenated exons, and the average 
signal in the same region was > 4 RPKM, the H3K36me3 status of 
the gene was defined as high. If the number of reads in the average 
signal was <1 RPKM, the H3K36me3 status of the gene was low. The 
H3K36me3 status for all other genes was defined as moderate. For 
Pol II in promoters, if a gene contained more than 10 ChlP-seq 
reads in its promoter region, and the average signal in its promoter 
was >2.5 RPKM, the Pol II promoter status of the gene was con- 
sidered high. If the average signal in its promoter was <1 RPKM, the 
Pol II promoter status of the gene was considered low. The Pol II 
promoter status for all other genes was considered moderate. For 
Pol II in gene bodies, if the average signal in the gene body was >2 
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RPKM, the Pol II gene body status of the gene was considered high. 
If the average signal in the gene body was <1 RPKM, the Pol II gene 
body status of the gene was considered low. The Pol II gene body 
status for all other genes was considered moderate. 

Data access 

The DNA and RNA sequencing data of zebrafish from this study 
have been submitted to the NCBI Gene Expression Omnibus 
(GEO; http://www.ncbi.nlm.nih.gov/geo/) under accession number 
GSE44269. The DNA sequencing data of mouse from this study have 
been submitted to NCBI GEO under accession number GSE51766. 
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